Text Categorization with Semantic Commonsense Knowledge
ثبت نشده
چکیده
Most of text categorization research exploit bag-of-words text representation. In this approach, however, all contextual information contained in text is neglected. Therefore, capturing semantic similarity between text documents that share very little or even no vocabulary is not possible. In this paper we present an approach that combines well established kernel text classifiers with external contextual commonsense knowledge. We propose a method for computing semantic similarity between words as a result of diffusion process in ConceptNet semantic space. Evaluation on a Reuters dataset show a substantial improvement in precision of classification.
منابع مشابه
Text Categorization with Semantic Commonsense Knowledge: First Results
Most of text categorization research exploit bag-of-words text representation. However, such representation makes it very hard to capture semantic similarity between text documents that share very little or even no vocabulary. In this paper we present preliminary results obtained with a novel approach that combines well established kernel text classifiers with external contextual commonsense kn...
متن کاملWikipedia-based Semantic Interpretation for Natural Language Processing
Adequate representation of natural language semantics requires access to vast amounts of common sense and domain-specific world knowledge. Prior work in the field was based on purely statistical techniques that did not make use of background knowledge, on limited lexicographic knowledge bases such as WordNet, or on huge manual efforts such as the CYC project. Here we propose a novel method, cal...
متن کاملInferential Commonsense Knowledge from Text
To enable human-level artificial intelligence,machinesmust have access to the same kind of commonsense knowledge about the world that people have. e best source of such knowledge is text – learning by reading. Implicit in linguistic discourse is information about what people assume to be possible or expect to happen. From these references, I obtain an extensive collection of semantically under...
متن کاملWikipedia-based Compact Hierarchical Semantics for Natural Language Processing
A correct semantic representation of words and texts underlies many text processing tasks such as text categorization, word sense disambiguation, and semantic relatedness assessment. It has long been recognized that computers require access to common-sense and domain-specific world knowledge in order to process textual data at a deeper level. In this paper, we present a novel representation of ...
متن کاملSemantic Understanding and Commonsense Reasoning in an Adaptive Photo Agent
In a story telling authoring task, an author often wants to set up meaningful connections between different media, such as between a text and photographs. To facilitate this task, it is helpful to have a software agent dynamically adapt the presentation of a media database to the user's authoring activities, and look for opportunities for annotation and retrieval. Expecting the user to manually...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2007